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The difference between any eigenvector u v of a linear operators! and its Rayleigh-Ritz 
approximation w v is bounded in terms of the differences between the eigenvalues X; of A and 
their Rayleigh-Ritz upper bounds /c*. The bound for the difference between u v and w v 
approaches zero with k v —\ v . 



1. Introduction 

The most common method of approximating the 
eigenvalues Xi < X 2 < . . . of a symmetric linear oper- 
ator A is the Rayleigh-Ritz method [l^,^]. 1 This 
reduces an eigenvalue problem on a space of a large 
or even infinite number of dimensions to an eigen- 
value problem on a space of relatively few dimensions. 

If the desired eigenvalues X* are characterized as 
minima, the Rayleigh-Rifz approximations n t give 
upper bounds for them. 

Along with the upper bounds K t for the eigenvalues 
the Rayleigh-Ritz method yields associated vectors 
w t . It is to be expected that these vectors approxi- 
mate the eigenvectors of A in some sense. Further- 
more, it is to be expected that the better the eigenvalue 
Xi is approximated by K f , the better will be the 
approximation of w t to the corresponding eigen- 
vector. 

Indeed, this is easily seen in the case of the first 
eigenvector. If the unit vector w x is expanded in 
terms of the normalized eigenvectors u t of A, we have 



and 






(l.D 

(1.2) 
(1.3) 



Subtracting \ times (1.2) from (1.3) and noting that 
X 2 < X 3 < . . . we find that 



2 \X 2 — Ki 



a\>\ 



or equivalently 



Xi 



1 — ^{Wl—UijWi — U^ I 



i)J>: 



Xo — Xi 



(1.4) 



(1.5) 



In general the X* are unknown, and we must express 



1 Figures in brackets indicate the literature references at the end of this paper. 



our results in terms of the k { and any lower bounds 
\ t for the Xi that may be available. Such bounds 
can be obtained by various methods (see for example 
[1,2,3,4,5^6,10,11,12,13,14,19,20,21,22,23,24,25,26]). 
If Xi>Xi, the inequality (1.5) leads to 

[l-^^-u^-uolyi-^h- (1.6) 
L ^ J X 2 — Xi 

This inequality shows that if the maximum error 
/ci — Xi is small compared with the interval X 2 — Xi, the 
difference Wi—Ui is small in norm. The bound (1.6) 
is sharp in the sense that equality is attained when 
\t=\i and a,i=0 for £>2. The inequality (1.6) is 
trivial for K!>X 2 . 

In this paper we generalize the bound (1.6). We 
give a bound for the norm of w p — u p in terms of the 
given bounds K t and X*. This bound is again sharp 
in the sense that equality may be attained. The 
bound for _(w p —u p ,w p —u p ) is small if the maximum 
error k p —\ p is small relative to both X p+ i— k p and 
\ p — Kp_i. It becomes trivial if K p >\ p+i or /Cp_!>Xp. 
The case p=l gives an improved but more compli- 
cated version of (1.6). 

If X p is multiple eigenvalue, we can only expect 
w p to approximate one of the associated eigenvectors. 
Hence if \ p lies near to several other eigenvalues we 
must expect w p to approximate not u p but a linear 
combination of the eigenvectors corresponding to 
the nearby eigenvalues. This approximation is 
established in section 3. 

It is possible to find a bound for (w p —u p ,w p —u p ) 
by determining to what extent w p satisfies the 
eigenvalue equation. Such bounds, which involve 
(Awp—KpWpAwp—KpWp) have been found by several 
authors [12, 20, 23]. Our bound, however, involves 
only the K t and the lower bounds X*. It should be 
particularly useful in the case of differential opera- 
tors where the Rayleigh-Ritz trial functions may 
not be sufficiently differentiable to give a finite 
value of (Aw p Awp). Our bounds are established 
by algebraic means for the case when A is an NXN 
matrix. They are independent of N. Consequent- 
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ly, the bounds also hold for infinite-dimensional 
operators A whose first p eigenvalues and eigen- 
vectors are approximated uniformly by those of a 
sequence A N of NXN matrices. This is certainly 
the case if A is completely continuous. It also 
holds under the weaker condition that A have p 
discrete eigenvalues defined by a minimum maxi- 
mum principle. These must lie below any contin- 
uous spectrum. Thus A may be a Schroedinger 
operator corresponding to both bound and unbound 
states. 

The fact that the eigenvalues X* are stationary 
values of the Rayleigh quotient tends to make the 
approximation of the eigenvectors worse than that 
of the eigenvalues. In fact, the bound (2.41) shows 
that the square of the norm of the error w p —u p is 

of the order k p —\ p . 

The error bounds in sections 2 and 3 are in the 
sense of the norm. If A is a differential operator, 
its eigenvectors u t are functions. It is often of 
interest to approximate the value of the function at 
a particular point. An adaptation of the method 
of Diaz and Greenberg [7, 9] which leads to such a 
pointwise approximation is presented in section 4. 



2. Separated Eigenvalues 

Let A be an hermitian NXN matrix. It is a 
linear operator on Euclidean iV-space. Let the 
usual scalar product between two vectors u and v 
on this space be denoted by (u,v). 

Let the eigenvalues of A be \ <X 2 < . . . <Aat. 
The corresponding eigenvectors are denoted by 
u u . . . ,u N so that the equations 

Au i =\ i u i i=l, . . ., N (2.1) 

are satisfied. The u t are normalized so that 

(u i u j ) = 5 ij i,j=l, . . ., N. (2.2) 

Then (2.1) implies 

(Au^^Xidij i,j=l, . . ., N. (2.3) 

We suppose that the Rayleigh-Ritz method 
[1, 5, 19] is applied to find upper bounds 



K«>Aa 



a=l, . . ., M 



(2.4) 



for the first M<^N eigenvalues of A. This is done 
by choosing M linearly independent vectors 
Vi, . . . , v M and finding the roots k\<k 2 < . . . <k m 
of the secular equation 

det [(Av a vp) — K(va,vp)] = a,0=l, . . ., M. (2. 5) 

Associated with each of the k u is a linear combination 



M 

0=1 



(2.6) 



of unit length such that the M linear equations 



(Aw a ,Vfi) = K a (Wa,V(i) 

are satisfied. Then 



and 



(Aw a ,wp) = K a 8 a 



",P=h 



., M. (2.7) 



(2.8) 



M. (2.9) 



We assume that in addition to the upper bounds 
K a we have determined some lower bounds X* such that 



X ? ;> X 7 ; 



= 1, . . .,N. 



(2.10) 



This means that we have at least some idea of the 
degree of approximation of the K a to the eigenvalues X a 

We seek to determine from this degree of approxi- 
mation of the K a to the eigenvalues the degree of ap- 
proximation of a particular Rayleigh-Ritz eigenvector 
w p to the eigenvector u P of A. 

We assume _that X z _ 1 <X 2? <X /?+1 and that the 

are sufficiently good that 

(2.11) 

(2.12) 

Our problem is the following: Given the M vectors 
w lt . . . , % satisfying (2.8), find the largest value 
of the deviation (w p —u Pt w p —u p ) of w p from the pth 
normalized eigenvector of any matrix A satisfying 
the eq (2.9) and having eigenvalues \ t satisfying 
(2.10). 

If the eigenvalues \ t of A are given, A is completely 
specified by prescribing its normalized eigenvectors. 
These eigenvectors u x t . . . , u N form a basis, so that 
the w a can be written as linear combinations of them. 
Let 



Kp-l 


K p 


\ p 


and X p+ i are 

Kp\Xp+i 
K p -i<^\ p . 



N 

w a =Y] a a Ui 



x=l. 



.,M. 



(2.13) 



Since the u t satisfy (2.2) and (2.3), the eqs (2.8) 
and (2.9) become 

N 

2_, a a dp=8ap 



and 



TJ \a l a al=K a b a $ «,/3=l, . . ., M. 



On the other hand, 

{w p —u Pt w p — u p )=2(l— a v p ). 



(2.14) 
(2.15) 

(2.16) 



If Up satisfies (2.2) and (2.3), so does —u p . We 
choose the sign to make a p p non-negative. Thus, our 



problem of maximizing (w % 



w p —u p ) is reduced 



to that of minimizing (a v p ) 2 under the constraints 

(2.14), (2.15), and (2.10). 
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We first keep the eigenvalues \ t fixed and unequal. 
If a* is the minimizing set of coefficients, we find by 
direct differentiation that the equations 



M 



afrp&ap+lt, (r a p+*iSafi)a>fi=Q 



P=l 



i=l,...,N (2.17) 



must be satisfied. The r a p and s al3 are Lagrange mul- 
tipliers, and are symmetric: 






(2.18) 



To solve the eq (2.17), we multiply by a\ and sum 
with respect to i. Using (2.14) and (2.15) we find 
that 

ala*d ap +r a y+KyS ay =0. (2.19) 

Interchanging a and 7 and using (2.18), we find 

7'a7 = \K a Ky) K pdp[0-'yO a p d a y ; ,J, 

Say = (K a —K y )~ l a p p [a^8 ap —a v J yp ] a^y. (2.20) 

In particular, r ay and s ay vanish unless a =y or either 
a or 7 is p. 

Letting a=y in (2.19), we find 

r a a+fc a s aa =-(a;>5 ttJ ,. (2.21) 

We substitute (2.20) and (2.21) in (2.17) to obtain 

(K p — K a )(\t— Ka)«aaai+(\i-K p )al<l P a ai=0 

and 



a^p 

(2.22) 



jSr^p «/3 ftp 

= 0, i=l,...,iV. (2.23) 

If the value a v v =0 is compatible with the con- 
straints (2.14) and(2.15), the minimum value of (a£) 2 
is clearly zero. We suppose for the moment that this 
is not the case, so that 



a% ^0. 



(2.24) 



Then we can solve (2.23) with i=p for s pp in terms 
of the a%. We can also eliminate s aa between (2.22) 
with i=p and any other value of i to obtain 

<»t" ^!\"l^ (a ? )2 - (2 - 25) 

Substituting this andlthe value of s pp in (2.23) we 
obtain 



\ m ^p~~ K p\- /3=1 p = lAi—Kp JJ 

=0, i^2>. (2.26) 

Suppose now that exactly L of the coefficients a\ 
are nonzero: 



ag^O for /3=ft. &, . . ., ft; 
fc<ft< • • • <^l. 



(2.27) 



By (2.24) one of the @ v =p. The term in braces in 
(2.26) times the product of the (X<— K0„) with & v ?±p 
is a polynomial of degree L in X*. Hence it vanishes 
for at most L values of i. Consequently, a\ ^0 for 

L<L values of i^p. 

From (2.22) it follows that a* =0 implies s aa a l a =0 
for all a. But s aa =0 implies a*=0. Hence a l p = 
implies ajj„ = 0. Thus our vectors w a split into two 
subsets: The subset Si consists of L orthonormal vec- 
tors tify. . . ., Wfi L (including w p ) having only com- 
ponents in the Z+l directions u ht . . ., ^i£ +1 - The 
subset £ 2 consists of M— L — 1 vectors orthogonal 
to w p . 

It follows that L must be either L or Z— 1. In 
the latter case, the L vectors w fiy in the first set will 
be the eigenvectors u tv . . . , u iL themselves. In 
particular, w p is u p so that the corresponding maxi- 
mum of (w p —u p< w p —u p ) is zero. This occurs if and 
only if k p = \ p . The more interesting case is that in 
which 

L=L. (2.28) 

The term in the braces in (2.26) vanishes for 
i=i lt i 2t . . ., i L+1 (ii<i2< . • • <i L +i) except when 
i v =p. This represents a set of L linear equations in 
the L unknowns (a^) 2 . . . , {af L ) 2 - It can be solved 
explicitly to yield 









(2.29) 



" K^p 



(This result can be checked by the Lagrange inter- 
polation formula [17]. A similar solution of a closely 
related equation was given by K. Loewner, Math. 
Z. 38, 180-181 (1934).) 

We now let &=fa in (2.25), multiply by a;, and 
sum. By (2.14) we have 



(aS') 2 +rbS>;) 2 =s 



i r ^pX* r — Kft 



Kfc, 



M = l L. 

(2.30) 

Again, we can solve explicitly for the L unknowns 
(a^Y,i T ^p. Using (2.25) and (2.29), we find that 



«) 2 = 






(2.31) 
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The solution (2.31) formally satisfies the conditions 
(2.14), (2.15), and (2.17). (The square roots must 
be chosen so that a /% is (\ ia — k/O -1 times a function 
of i a only times a function of ft only.) In order to 
be admissible, the coefficients must be real. It is 
easily seen that this is the case if and only if 



*i,<K/8j<Xtj< • • • <Kj8 L Oi 



(2.32) 



The vectors wp in the subset S 2 are orthogonal to 
Wp . . ., Wp L and u p . It follows from (2.31) that 
they are orthogonal to u ilt . . . ,^i L+v This is pos- 
sible if and only if to each i with i^h, . . ., i L +i 
there corresponds a separate k& with 07^ft, . . ., ($ L 
such that Kp>\i. 

We now consider the possible minima of (a p p ) 2 . 
Choosing a particular set of ft and i 9 , we find from 
(2.31) that 



(a?) 2 - n 



"% n *p—\ 



v9 £P K p K V i T ^p *p \ 



(2.33) 



Because of (2.32) 



,fa-^)(y-^)<l (2.34) 



( K P"- K ^)( X P- X ^) (*P— K0,)( X P~~ X O 

for i„># or ft<#. 

Thus, the right-hand side of (2.33) is increased 
by dropping any pair \ (v , k$ v . This means that its 
minimum will be attained when the sets i lt . . ., 
izrWi ft, • • •> ft. are maximal with respect to the 
properties required of them. We further note that 
the right-hand side of (2.33) is an increasing function 
of \* T for i T 9^p and a decreasing function of Kp p for 

Keeping these facts in mind, we construct the 
sets i lt . . ., i L+1 , ft, . . ., L which minimize 
(al) 2 for fixed unequal \ t as follows. 



Let 



Let 



Let 



h=l. 



fc=min{t|303Xi<ty<X«}. 



ft=max{j8|/c i8 <X f2 }. 
Then inductively, let 

^ +1 =min{i|3K0 3 K<Kp<\i 
j8,=max{/J|i^<X v+1 }. 



(2.35) 



(2.36) 



(2.37) 



(2.38) 



Because of (2.11) the set of ft includes #. If p is 
not included in the i 9 , we can easily construct a 
solution of the eqs (2.14) and (2.15) with a?=0. 
To do this we define the (ajj) 2 by (2.31). Then 

Conversely, if (2.24) is violated so that the mini- 
mum of (al) 2 is zero, we can consider the problem of 
minimizing a coefficient (al) 2 with g<ip under the 
constraints (2.14), (2.15), and a l v = for some set of 
i<ip including^. Since k p <^\ p+) , not all the al can 
vanish. Therefore this minimum problem will have a 
non-zero solution for some g: and some set of i<p. 
The minimizing conditions again lead to the deter- 
mination of sets i 9 and ft by (2.35), (2.36), (2.37), 
and (2.38). The integer p is included in the ft but 
not in the i 9 . 

Thus, al 7^0 if and only if p is one of the i 9 . It 
follows from the construction of the i 9 that this 
will be the case if and only if there is a up such that 
\ p -i<C^<O^P' This is assured by (2.12) for any X* 
satisfying (2.10). Condition (2.12) is therefore 
necessary and sufficient for (al) 2 to have a nonzero 
minimum. 

The minimum value of (a%) 2 is now given by 
(2.33). It is a continuous nondecreasing function 
of the eigenvalues X*. Hence its minimum with 
respect to the X* satisfying (2.10) will occur for 
\.= \.. We may remove the assumption that the 
Xj are unequal by a limiting process. This will alter 
the inequalities in (2.37) and (2.38) slightly. 

As we pointed out in the introduction, we can 
replace the matrix A by a symmetric operator on a 
Hilbert space. We need only assume that A has 
at least p discrete eigenvalues defined by a minimum 
maximum principle [1,5,19]. For then the first N 
eigenvalues of A and their corresponding eigen- 
vectors are uniformly approximated by those of an 
N'XN' matrix A N * for N' sufficiently large. If 
\ f i=l, .. . . , N are lower bounds for the first N 
eigenvalues of A, there is an e N > which goes to zero 
as iV— ><» such that X<— e N > are lower _bounds for 
the first N eigenvalues of A N >. Also, \ N —e N > is a 
lower bound for the higher eigenvalues of A N >. 
We let 2V'-^oo for fixed M and N. Using (2.33) 
and (2.34), we obtain the following theorem. 

Theorem 1. Let k 1 <k 2 < . . . <k m be the 
Rayleigh-Ritz upper bounds for the Jirst M of the 
eigenvalues Xi<X 2 < . . . of a symmetric linear 
operator A. Let \ x < . . . <\ N be lower bounds 
for the first N eigenvalues of A, N>M. 



Let 



ac 7 ,_ 1 <X 2 ,</c 2 ,<X 



A^N^P+l- 



(2.39) 



Define the numbers 
ii<i 2 < . . . <i L +i, ft<ft< • . . <ft, by 

i x = l 

i 2 =min{i|Xj>/ci} 
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iSi = max{j8|ic / 5<X f2 } 
i + i=min {i|aftX<„<K/j<Xf 



(2.40) 



Then if w p is the normalized Rayleigh-Ritz eigen- 
vector corresponding to the bound k p and u p is the 
normalized eigenvector of A corresponding to the 
eigenvalue \ p , 



I 1 ~2 ( w p— u p> w p— u p)\ 




(>-<) (V~>* F )j 
(2.41) 



The right-hand side of this inequality approaches 
zero if either k p approaches \ p+1 or k p ^ approaches \ p . 
It is near one if the error k p —\ p is small relative to 
the approximate spacing X ilrfl — X p and if the prod- 
ucts of errors (k p — X„) (>%— X <j( ) are small relative to 
the products of approximate spacings (k p — k^ (X p — X^). 

If lower bounds \ t are not given for all the eigen- 
values X/ ( we can always use a lower bound for any 
particular eigenvalue as a lower bound for a higher 
eigenvalue. In particular, we can let \ t = X p+1 for i^>p 
and X z =Xx for i<Cp. If 2?>1, (2.39) requires that 
/Cp_!<X p . Then L=2, ii=l, Pi=p— 1, i 2 =P2=p, 
h=p + l. This leads to the simpler bound 



I 1— -(w p — m p w v —u v ) 

f 1 *?— ^ \ / 1 _ ( /c p~ Q(* P -i— xQ \ 

\ X^+i— X,/ \ (^-^-i)(x p -Xi)J ^' 



> 



42) 



Even though this bound has fewer factors than (2.41), 
it is, in general, smaller. 

Forp=l we can take X*=X 2 for i>2. Then L=l, 
ii=ft=l, ^2=2, and (2.41) reduces to (1.6). 

Example. We apply the Rayleigh-Ritz method to 
the matrix 



A= 



r l 1 


(T 


1 3 


1 


^0 1 


23^ 



(2.43) 



using trial vectors with vanishing third component. 
This amounts to finding the eigenvalues and eigen- 
vectors of the 2X2 matrix obtained by striking out 
the third row and column of A. We obtain 



Kl =2— v/2",k 2 =2+V2 



(2.44) 



with the corresponding vectors 

^^(I(2+V2) 1/2 4(2-V2) 1/2 ,0) 

^ 2 ^Q(2-V2) 1/2 ,i(2+V2) 1/2 ,0) 



(2.45) 



If the first two coordinate directions are replaced 
by the w x and w 2 directions, the matrix A ^becomes 



A*= 



M 







*2 



:*i 



1/2 



~2« 2 



-W' 2 



\+" 23 



(2.46) 



We now obtain lower bounds forjthe ^eigenvalues 
by means of Hadamard's theorem <on determinants 
[16]. 



Xi=ki— -ici 1/2 , 



= K2-^2 1/2 . 



(2.47) 



For X 3 we use the fact that the largest eigenvalue 
must exceed the largest diagonal element. Then 

X 3 =23. (2.48) 

Substituting these values in (2.41) we find that 
(wi-^i, Wi-^i)<0.08025, 



(w 2 -u 2 , w 2 -^ 2 )<0.09980. 



(2.49) 



In this problem one can, of course, determine the 
eigenvalues and the corresponding eigenvectors ex- 
plicitly. We find that 



{w l —u lt w 1 -^ 1 ) = 0.000328, 

(w 2 -u 2t w 2 -u 2 ) =0.002238. 



(2.50) 



Thus, the error bounds (2.49) are a good deal larger 
than the errors themselves. _ 

Our error bounds depend upon the lower bounds X*. 
These were chosen rather crudely and could be un- 
proved in various ways (see, for example [5]). In 
order to determine the effect of such an improvement, 
we replace the X* by the eigenvalues X* in (2.41). 
We then obtain the bounds 



(w x -u u W!-%)<0.000330, 
(w 2 -u 2 , w 2 -u 2 )< 0.002243. 



(2.51) 
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These are very close to the actual values (2.50). 
The simpler bound (1.6) gives 

(Wi—ut, w t — Ui) <0. 195136 (2.52) 

if the values (2.47) of X* are used, and 

(Wi—v,!, w l —u 1 )<0. 002364 (2.53) 

if they are replaced by the X* themselves. In both 
cases we see that the bound (2.41) is significantly 
better than the simplified bound (1.6). 

3. Neighboring Eigenvalues 

The condition (2.39) implies that the eigenvalue \ p 
is simple. If this is not the case, the corresponding 
eigenvector u p is not uniquely defined. In fact, if \ p 
has multiplicity m, u p may be any element of an 
m-space. If m>l, there will always be such a u p 
orthogonal to w p , so that the minimum of (w p ,u p ) 
is zero. 

We must reformulate our problem. We seek the 
minimum value of (w p ,u p ) 2 when u p is taken to be 
that element of the m-space which best approximates 
w p . This u p is the projection of w p into the m-space 
of eigenvectors corresponding to X^,. 

The condition (2.39) implies not only that X^, is 
simple, but that our bounds k p -i, k p , \ py and X p+i are 
good enough to reveal its simplicity. That is, the 
error in our bounds is smaller than the separations 
between X p _i, \ p , and X p+1 . If this is not the case, 
we cannot distinguish between a simple and a mul- 
tiple eigenvalue. 

Suppose now that the upper and lower bounds for 
\ p , \ p +i, . . ., Xp +W _i show these eigenvalues to lie 
close together. Suppose further that 



<p-i 



? ^^^P) Kp+m-l ^^^U 



(3.1) 



so that Xp_i and \ p + m are known to lie away from the 
cluster of eigenvalues about X^,. Then, to our degree 
of approximation \ p behaves like an eigenvalue of 
multiplicity m. We ask how well w p can be ap- 
proximated by a linear combination of unit length 
of the eigenvectors u p , u p+1 , . . ., u p+m -i. 

This problem is equivalent to that of minimizing 

p +m— 1 

X) (K) 2 under the constraints (2.14), (2.15), and 

v 
(2.10). By (2.20) we have r a(3 =s a f3=0 unless a=j3 
or a. or p=p. Moreover, r ap +K p s ap =0, r aa +K a s aa =0 
for a^p. This means that we would obtain the 
same minimizing conditions by imposing only the 
constraints (2.14) with a=/3 or a or P=p and the 
single constraint (2.15) with a=P=p. The latter 
may even be replaced by the inequality 



52Hal) 2 <K p . 



(3.2) 



The other constraints (2.14) and (2.15) determine 
which local minima actually occur, but the local 



minima themselves are determined by (2.14) and 
(3.2). 

The same situation applies in the case under con- 
sideration here. Necessary conditions for a maxi- 

p-j-m—l 

mum of XI (&p) 2 are determined by the constraints 

(2.14), (2.10), and (3.2). 

Let u p be the unit vector in the direction of the 
projection of w p into the space spanned by u p , . . ., 



v p+m—l' 



{p+ra-1 *"\- 

5 {aiY s 



1/2 p+m-1 

2 <u t . (3.3) 

p 



Let u p+u . . ., u p+m ^i be other linear combinations of 
u p , . . ., Wp+w-i such that u p , . . ., u p+m -i are 
orthonormal. Let Ui=u t for i^p, . . ., p+m—1 
and put 

a^iwa, u t ). (3.4) 

Then by construction 

al = i=p+l, . . ., p + m-1 (3.5) 



and 



Moreover, 



i=p 



(3.6) 



&M) 2 >^UaiY+K(al) 2 +j:\ { (ai) 2 . (3.7) 

2=1 1 p+m 



Thus, if we let 



x,= 



i=p, . . ., p + m—1 



j k p i=p, . . . 
I \ t otherwise. 



We have from (3.2) and (3.7) that 



N 



T,h(aiY<K P . 



(3.8) 



(3.9) 



Since the u t are orthonormal, (2.14) becomes 

2Xa*=a«* (3.io) 

Our problem is thus reduced to minimizing (aj) 2 
under the constraints (3.10), (3.9), (2.10), and (3.5). 
The conditions for local minima are found to be as in 
section 2. However, the constraints (3.5) together 
with the fact that K p + m -i<^\ p + m relegate the vectors 
Wpn, . . ., w p+m _i to the set S 2 orthogonal to u p . 
Furthermore, the conditions (3.5) eliminate the 
values i=p J r l, . . ., p+m — 1 from the v 

Thus we find the following theorem. 

Theorem 2. Let ki<k 2 < . . . <k m be the Ray- 
leigh-Ritz upper bounds for the first M eigenvalues 
Xi<Xa< . . . of an hermitian operator A. Let 
Xi< . . . <\ N be lower bounds for the first N>M 
eigenvalues. For a certain p and m let 
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K p-l < ^^py 



(3.11) 



Define the numbers i u ?* 2 , . . ., Il+u ft> • • •> ft, 
6y (2.40) w^A Xp+i, . . ., Xp+w-i eliminated from the 
set of X*, and k p + u . . ., K p+m -i eliminated from the set 

of K a . 

Let w p be the Rayleigh-Ritz eigenvector that gives 
the bound k p . Then there exists a linear combination oj 
unit length u p of the eigenvectors u p , . . ., *u P+m -i of A 
such that 



* r (k p -k)(^-k) *\ (3 



0^p 



12) 



As in section 2 we obtain a simpler but a weaker 
inequality by putting { \ i =\ p+m for i^>p-\-m and 
\t=\i for i<^p when p^>\. 
This leads to 



N 



{W V — U V ,V} I> ~U^ 



T>{i- - K '~ x i -} 

{ 1 (k p — X p )(/c p -i — Xi) \ 
(k p — Kp_i)(Xj,— Xi) J 



(3.13) 



Forp=l we only have to put \i=\ p + m for iy>p + m to 
obtain 

fl— |(tgi— &i,tl^— flt)T>l— ,- Jft ~ Xl - - (3.14) 

L -I X m+ i — Xi 

By the same reasoning we can show that there is a 
linear combination u p+g of u p , . . ., u p + m _i that 
approximates w p+g with 0<#<ra. We eliminate 
Kp, . . ., K p + m _i except for k p+q from the K a and 
Xp+i, . . ., \ p+m -i from the X z in forming the sets 
p v and i,. Then we obtain the inequalities (3.12) 
and (3.13) with w p replaced by w p + g , u p by w p+3 , and 



by k 



p+q- 



Example. We consider the vibrations of a uni- 
form beam which is free at its ends and which lies on 
an elastic foundation with small linearly varying 
elastic constant. It satisfies the differential equation 



u lv + exu=\u 0<#<1 



(3.15) 



with the end conditions 



u"(0)=u'"(0) = u"(l)=u"'(l) = (3.16) 

The constant e is positive and small. We introduce 
the scalar product 



fa. 



••>-/.' 



uvdx 



(3.17) 



and define the symmetric operator A by the bilinear 
form 

(Au, v)= {\u"v" + euv]dx (3.18) 

If the eigenvalues \ x < X 2 < . . . of A are defined as 
the successive minima of the Rayleigh quotient 
(Av, v)/(v, v), they coincide with those of the problem 
(3.15), (3.16). 

We are concerned with the two lowest eigenvalues. 
For e=0 they both vanish. Corresponding eigen- 
f unctions are 1 and 6~ 1/2 (1— 2x). We use these as 
trial functions in the Rayleigh-Ritz method. We 
obtain 

k x =\{1 -6-^)6, 

K 2 =l(l + 6-^)e f (3.19) 

withjthe corresponding vectors 

W 1 =2' 1 ^[l-Q- l/2 (l-2x)] 

^ 2 =2- 1 / 2 [l + 6- 1/2 (l-2a:)] (3.20) 

To obtain lower bounds A* we note that (Av, v) |is 
greater for e>0 than for € = 0. Thus, «the X* are 
bounded below by the eigenvalues \ t of the problem 
(3.15), (3.16) with e=0. These can be found ex- 
plicitly. We find 



Xi=X 2 =0 
X 3 =500.462. 



(3.21) 



Condition (2.39) is violated so that we cannot say 
how well w x approximates u x without improving our 
bounds. However, we can use theorem 2 to state 
that there are linear combinations u x and u 2 of u x 
and u 2 such that 

P (^ 1 -^i) 2 &<2[l-{l-0.0005912e} 1/2 ], 

(w 2 -u 2 ) 2 dx<2[l-{l-0.00U070e} V2 ]. 

(3.22) 

Thus we have shown that w x and w 2 approximate 
linear combinations of u x and u 2 in the mean square 
sense. 

When, as in this example, A is unbounded, it is 
often more desirable to have a bound for the devia- 
tion (A(w p —Up)Wp—u p ) rather than (w p —u p ,w p —u p ) 
In order to obtain such a bound we note that the 
quadratic form (Av,v)+c(v,v) is positive definite for 
c>-Ai. 
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Hence we can define a new scalar product 

[u,v] = (Au,v) +c(u,v) . (3.23) 

A 

We now define the operator A by 

[Au,v]= — (u,v). (3.24) 

Then the eigenvectors of A are multiples of those of 
A, and its eigenvalues are — (Xi+c)" 1 . Applying 
theorems 1 and 2 to A and expressing the results in 

terms of A, we find the following. 

Theorem 3. Under the hypotheses of theorem 2 there 
exists for any constant c>— Xi a, linear combination u of 



and 



, u p+m -i such that 

(Au,u) +c(u,u) =k p +c 



[^ 



i 



(3.25) 

p —u) + c(w p —u,w p —u)} 

>(K P +c)(\ P +c)fi- * p ~\ \ n 



+ c+- {(Aw p —Au,w, 



{ 



(« 



Xp) (.K^—Xi) 



(Kp—K0 t )(\p—X 



-^}(3.26) 



When the multiplicity m=l, w is a multiple of u p 
and we have the analog of theorem 1. 

Applying theorem 3 to the example (3.15), (3.16), 
we obtain the inequality 

r ^ ,,2 +(c+€^)(^ 1 -^) 2 ]&< 

2[c+1.704 € -c 1/2 {c+1.704e} 1/2 {l-0.0005912 € } 1/2 ] 

(3.27) 

for any c>0. (The function u depends upon c, how- 
ever.) 

4. Point wise Bounds for Eigenf unctions 

When A is a differential operator, theorems 1 and 
3 give bounds for the mean square deviation of the 
approximate eigenfunction w p from the exact eigen- 
function u p . It is often of interest to determine the 
value of u p at a particular point. 

In certain cases a pointwise bound for the devia- 
tion \w p — u\&t a point comes directly from the bound 
(3.26) of theorem 3. For example, we show that for 
any 0<f<l 



M*)-&ft)| 2 < 



where 



G(U) £ [(«*- 



-'S) //2 +c('^ 1 — u) 2 ]dx, 
(4.1) 



Of(j,©=i6-3[sinh 2 6-sin 2 6]- 



X[sinh 6 cosh b— sin b cos b 

+2 sinh b cosh 6(1— 2£) —2 sin b cos b (1 — 2?) 

+ (sinh b cos 6+ sin b cosh b) sin 6(1 

- 2?) sinh 6 (1 — 2?) + (sinh & cos 6 

-sin b cosh 6) cos 6(1 -2 J) cosh 6(1-20] 



<bj 



(4.2) 



(See, for example, [8]). Thus, (3.27) gives a bound 
for \vh(Q-u(Q\. 

In the case of partial differential operators such a 
bound may or may not exist. If it exists, it is diffi- 
cult to find. 

However, one can use the following adaptation of 
the method of Diaz and Greenberg [7,9] )cf. [15,18]). 
For the sake of simplicity we present it only for the 
case of a special second order operator in two dimen- 
sions. 

Consider the eigenvalue problem 



—Au-\-r(x,y)u=\q(x,y)u 



(4.3) 



on a two-dimensional domain D with smooth bound- 
ary C. Here A is the usual Laplace operator. The 
function q is positive and r is non-negative, and both 
are continuous in the closure of D. The boundary O 
consist of two parts Oi and C 2 , and we have boundary 
conditions 



^=0 
-^+k(x,y)u=0 



on d 
on C 2 , 



We define the scalar product 
(u, v)= \ I quvdxdy 



k>0. (4.4) 



(4.5) 



on the linear vector space of functions which are 
piecewise continuously differentiable in D and vanish 
on d. 

Let u p be the normalized eigenfunction correspond- 
ing to the eigenvalue X p , and let w p be the function 
corresponding to a Kayleigh-Ritz upper bound 
k p for \ p . Theorem 1 gives a bound for the deviation 
in norm (w p —u p , w p —u p ). 

We wish to approximate the value of u p at an 
interior point of D, which we choose as the origin of 
our coordinate system. We use the fact that u v 
satisfies the differential equation (4.3) with \=\ p and 
the boundary conditions (4.4). Let T(x, y) be a 
parametrix for the differential equation (4.3) satisfy- 
ing (4.4). That is, 

rfo y) = - (4tt) - 1 log (x 2 +y 2 ) +4>(z, y) , (4.6) 

where <j> is any twice continuously differentiable 
function such that 
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2»r 



dn 



T=0 on Ci 



HT=0 on C 2 



(4.7) 



Multiplying (4.3) by T and integrating by parts, 
we find that 

Up(0, 0)=jju p [-A<t>+(r-\ p q)T]dxdy. (4.8 

Z> 

Replacing u p by w p and X p by k v on the right, we 
obtain the value 

<(0, 0)= fJ^[-A*+(r-^2)r](fedy (4.9) 



which can be computed by quadratures. Using 
Schwarz's inequality, the normalization of u p , and 
the triangle inequality, we find 

|<(o, o)-u p (o, 0)1 <0c,-x,)(r, r) 1/2 



+ (w p — u p , w p —u p ) 1/2 



^ffr l l-A*+(r-KM*dxdy\- V% (4. 



10) 



Thus, the bound (2.41) for (w p —u p , w p —u p ) together 
with the bound k p —\ p for k p —\ p provides explicit 
upper and lower bounds for u p (0, 0). These bounds 
lie close together if the error bounds (2.41) and k p —\ p 
are small. 

The same method applies to the function u p of 
theorem 2. If 



p +m — l 



(4.11) 



we find from (4.8) that 



u>p(0, 0) = 2>* \\u i [-A<t>+{r-\ i q)Y}dxdy. 

i) 

(4.12) 

Hence if we again define the approximate value w% by 
(4.9), we obtain 

\w*(0, 0) — u p {0, 0)|< msix{K p — \ P) K p+m -l — K p ) 



(r, vy /2 +(w p -u p , w p -u p y 12 



(Sk 



} 



1/2 



[-A^+(r-K p q)T] 2 dxdy X • (4.13) 



This inequality together with theorem 2 gives 
upper and lower bounds for u p (0, 0). 
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